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ABSTRACT 

The spectral absorption lines in early-type galaxies contain a wealth of information regarding the detailed 
abundance pattern, star formation history, and stellar initial mass function (IMF) of the underlying stellar 
population. Using our new population synthesis model that accounts for the effect of variable abundance ratios 
of 1 1 elements, we analyze very high quality absorption line spectra of 38 early-type galaxies and the nuclear 
bulge of M3 1 . These data extend to 1 fim and they therefore include the IMF-sensitive spectral features Nal, 
Call, and FeH at 0.82/im, 0.86/ira and 0.99/im, respectively. The models fit the data well, with typical rms 
residuals < 1%. Strong constraints on the IMF and therefore the stellar mass-to-light ratio, (M/L) sta rs, are 
derived for individual galaxies. We find that the IMF becomes increasingly bottom-heavy with increasing 
velocity dispersion and [Mg/Fe]. At the lowest dispersions and [Mg/Fe] values the derived IMF is consistent 
with the Milky Way IMF, while at the highest dispersions and [Mg/Fe] values the derived IMF contains more 
low-mass stars (is more bottom-heavy) than even a Salpeter IMF. Our best-fit (M/L) stals values do not exceed 
dynamically-based M/L values. We also apply our models to stacked spectra of four metal-rich gl obular 
clusters in M31 and find an (M/L) stars that implies fewer low-mass stars than a Milky Way (Kroupa 200 1J) IMF, 
again agreeing with dynamical constraints. We discuss other possible explanations for the observed trends and 
conclude that variation in the IMF is the simplest and most plausible. 

Subject headings: galaxies: stellar content — galaxies: abundances — galaxies: early-type 



1. INTRODUCTION 

The stellar IMF plays a central role in many areas of astro- 
physics. It sets the overall stellar mass scale of galaxies, deter- 
mines the amount of energetic feedback following an episode 
of star formation via the ratio of high-to-low mass stars, gov- 
erns the nucleosynthetic history of galaxies, and, more funda- 
mentally, provides insight into the physics of star formation. 

Despite the central importance of the IMF, a comprehensive 
physical theory for its origin and variation with environment 
does not yet exist. It has been argued that the characteris- 
tic m ass scale and shape of the IMF is set by the Jeans mass 
(e.g. lLarsonll 19981 12005 1 ) by f e edback from pro tostars (Islikl 
119951: 1 Adams & Fatuzzolll996t iKrumholzj l20lib . or by the 
distribution o f densities in a supersonically turbulent interstel- 
lar me dium (Padoa n ~et :alJll997t [P adoan & Nordlund 120021: 
iHennebelle & Chab rieril2008t lHopkinsll2012allbh . 

Despite the variety of ideas, every theory predicts at least 
some variation in the IMF with physical properties. Remark- 
ably, observations in our Galaxy of star forming regions, open 
and globular clusters , and field stars have found little varia- 
tion in the IMF (e.g IScaI^ [T^lKr^pdl2001t lBastian et alJ 
120 10t iKroupa et all 1201 lh . The observations are however 
complicated by a myriad of selection effects, biases, and cor- 
rection factors. Observed luminosity functions must be cor- 
rected for binarity, stellar evolution, and, for globular clus- 
ters, dynamical evolution that preferentially ejects low-mass 
stars from the clusters. Moreover, direct constraints on the 
low-mass (M < 0.5M Q ) IMF are limited to relatively mun- 
dane environments. Nonetheless, unambiguous evidence for 
IMF variation from direct star counts does not currently exist. 
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In nearby and distant galaxies, direct estimates of the IMF 
to w 0. IMq from star counts is currently impossible, and, ex- 
cept perhaps for a handful of Local Group galaxies, will re- 
main impossible for the indefinite future. Less direct probes 
of the IMF are therefore required. 

It was recognized in the 1960s that low-mass stars, though 
individually faint, c an be detected in the integrated light of 
early-type galaxies (Spinrad 1962). Early attempts to ex- 
ploit this fact to measure the IMF in early-type galaxies lead 
to a variety of conflicting claims ( Spinrad & Taylor 1 1971 
Cohenlll978l: iFaber & Frenchlll980t [Hardy & Couturelll988 
Couture & Hardvl 119931: [Carter et al.lll986l iDelisle & Hardyl 
1992t ICouture & Hardvl 1 19931) . owing largely to the inade- 
quate quality of the data and models. Thanks largely to the 
increased quality of the stellar interior and atmospheric mod- 
els and data in the intervening years, this technique is now 
capable of providin g powerful constraints on the IMF (e.g., 
ICenarro et ai1l2003h . 

In recent work we built a new population synthesis model 
that allows for arbitrary variation in the IMF, stellar age (for 
ages > 3 Gyr), and the detailed abund ance patterns of the stars 
dConrov & van Dokkuml 120121) . In Ivan Dokkum & Conroyl 
(2010) we used a preliminary version of this new model 
to obtain constraints on the IMF in eight massive early- 
type galaxies in the Virgo and Coma clusters, finding ev- 
idence for an IMF in these galaxies that was much more 
bottom-heavy t han the canon ical Milky Way (MW) IMF 
(IKroupa! 120011: IChabrierl 120031) . Following this work, in 
van Dokku m & Conroyl (120111) we demonstrated that globu- 
lar clusters in M31 with abundance patterns similar to the 
massive galaxies did not show evidence in their spectra for a 
bottom-heavy IMF, which was a critical test of the technique 
because such clusters a re known not to have heavy IMFs from 
dynamical constraints (Strade r et al.ll201 lT) . An increasingly 
bottom-heavy IMF with increasing galaxy mass has also been 
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Table 1 

Model Parameters 



Parameter 


Prior 


Units 


Notes 


z 


<-] 000 1 000^ 


kms~' 




G 


(ZU,4UU) 


i 

kms 


velocity dispersion 


[re/rij 


(-U.4,U.4J 




Iron abundance 


[vJ,iNe,o/rej 


(-U.4,U.o; 




Oxygen, Neon, Sulfur abundance 


[L./rej 


(-U.4,U.4) 




Carbon abundance 


rVT/Cal 

[IN/rej 


/ r\ A n q\ 
(-U.4,U.o) 




Nitrogen abundance 


[iNa/r'ej 


< (\ A 1 1A 




Sodium abundance 


Livig/^ej 






Magnesium abundance 


[Si/Fe] 


(-0.4,0.4) 




Silicon abundance 


[i_a/rej 


(-U.4,U.4j 




Calcium abundance 


[Ti/Fe] 


(-0.4,0.4) 




Titanium abundance 


[Cr/Fe] 


(-0.4,0.4) 




Chromium abundance 


[Mn/Fe] 


(-0.4,0.4) 




Manganese abundance 


age 


(4,15.0) 


Gyr 


Age of bulk population 


log(/v) 


(-5.0,-0.3) 




fraction of young (3 Gyr) stars 


"1 


(0.0,3.5) 




IMF slope over O.1M < M < O.5M 


a 2 


(0.0,3.5) 




IMF slope over O.5M < M < l.OM 




2.3 




IMF slope over l.OM < M < 1OOM 


A(7 eff ) 


(-50,50) 


K 


temperature offset applied to all stars 


log(M7III) 


(-5.0,-0.3) 




fraction of additional M7III light 


log(/hot) 


(-5.0,-0.3) 




fraction of additional hot stars 


Thot 


(1,3) 




Temperature of additional hot stars 


Note. — The 


prior is flat within the range 


defined in the table and cuts off sharply 



outside of the prior range. The parameter 03 is fixed to the value 2.3. 



suggested by lSpiniello et alJ (120121) based on our models and 
data from the Sloan Digital Sky Survey. 

Constraints on the IMF in early-type galaxies from kine- 
matics and gravitational lensing also favor IMFs that be- 
come increasingly bottom-heavy toward higher galaxy masses 
dGrillo et al l 12001 iGnlio & Gobal] l201Qt iTreu et all l20lo[ 
Auger et alfcoiOUSpiniello et al.ll2012tlThomas et alj201 lbt 
Sonnenfeld et al J |20 1 2t ICappellari e t al.N2012l) . These tech- 



niques constrain the IMF by assuming that the stars and dark 
matter do not trace each other perfectly^ This assumption be- 
comes more plausible as the spatial dis tribution of the stars 
becomes less spherical (Cappellari et al.ll2.012b . 

Finally, constraints on the IMF from scaling relations and 
global models of galaxies and dark matter also favor an IMF 
that beco mes increasingly bottom- heavy for more massive 
galaxies dDutton et al.ll20Tlll2012bllal) . 

This paper is the second in a series of papers aimed at mea- 
suring the IMF in early-type galaxies. In van Dokkum & Con- 
roy (2012, hereafter Paper I) we present the sample, discuss 
data reduction techniques, and explore empirical trends in the 
absorption line spectra. In this paper we apply our stellar 
population synthesis model to these data in order to simulta- 
neously measure the stellar IMF, detailed abundance pattern, 
and mean stellar ages on a galaxy-by-galaxy basis. 

We proceed as follows. In Section|2]we briefly summarize 
the observations, and in Section [3] we provide an overview of 
the model. The results are presented in Section^] followed by 
a series of tests in Section [5] a discussion in Section [6] and a 
summary in Section [7] 

2. DATA 

Nearly all of the data analyzed in this paper have been ob- 
tained with the Low Resolution Imaging and Spectrometer 
(LRIS) on the Keck I telescope over the past three years. The 
early-type galaxy sample consists of a stacked spectrum of 

4 The dynamical constraints imply that the mass-to-light ratios are larger 
than expected for a MW IMF, which can be explained either by a bottom- 
heavy (dwarf-dominated) IMF or a bottom-light (remnant-dominated) IMF. 



four massive galaxies in the Virgo cluster (originally from 
van Dokkum & Conroy 2010), 34 galaxies dra wn from the 
SAURON sample of nearb y early-type galaxies (Bac on et al.l 
I200U Ide Zeeuw etail 120021) . and the nuclear bulge of M31. 
For all galaxies except for the stacked Virgo spectrum and 
M31 the spectra are extracted within 1/8 of the effective ra- 
dius, R e . Details of the sample selection and data reduction 
can be found in Paper I. 

We will also analyze spectra of four massive metal-rich 
globular clusters from M31. The r ed spectra for these ob- 
jects were obtained with LRIS by us (Ivan Dokkum & Conrovl 
1201 11) . while the blue spectra were obtained with Hectospec 
by Nelson Caldwell, and kindly provided to us. 



3. MODEL 

We employ our new stellar population synthe sis (SPS) 
model developed in IConrov & van Dokkuml d2012l CvD12), 
with several minor extensions. The core of the model 
is based on two empirical stellar libraries: MILES 
(Sanchez-Blazq uez et al.l 2006) in the o ptical and IRTF 
( ICushing et alJ 120051: iRavner et al.l 120091) in the near-IR, 
at solar metallici ty. Iso chrones from the Dartmouth 
( potter et all 120081) Padoy a dMarigo et al l 120 08), and Lyon 
(Chab rier & Baraffel [19971 iBaraffe et al.l 119981) groups are 
combined to provide high-quality isochrones over the full 
stellar mass range appropriate for older stellar populations. 
SPS models are then constructed for ages ranging from 3 - 
13.5 Gyr and for arbitrary IMFs by combining the isochrones 
with the empirical libraries. Constructing models with 
younger ages was not possible because the IRTF library does 
not currently include hot stars. In CvD12 we also computed 
a grid of fully synthetic stellar spectral libraries in order to 
model the variation of individual elemental abundances. In 
its present version, the model allows for variation in the ele- 
ments C, N, Na, Mg, Si, Ca, Ti, Cr, Mn, and Fe, and 0,Ne,S 
are varied in lock-step. We emphasize that the synthetic spec- 
tra are only used differentially with respect to the empirical 



The Stellar IMF in Early-Type Galaxies. II. Results 



3 



libraries^. The model makes predictions over the full optical 
and near-IR wavelength range of 0.36/ira < A < 2.4/j.m. 

All models that feature simultaneous variation in multiple 
elements are constructed by assuming that the effect of each 
element on the spectrum is independent of the others. For ex- 
ample, a model with [Mg/Fe]=+0.2 and [N/Fe]=+0.2 is cre- 
ated by combining two models, one with only Mg variation 
and one with only N variation. This is an assumption in every 
mod el that includes variation in mu ltiple abundance patterns 
(e.g. JThomas et alJ2003tlSchiavonl2007b . To our knowledge, 
this assumption has not been rigorously tested. We simply 
note it here as a possible systematic uncertainty in the model- 
ing. This issue will be addressed in future work. 

Moreover, we make the standard assumption that the re- 
sponse of the spectrum to variation in an element is linear in 
[e/Fe], where 'e' stands for a generic element. In reality, the 
variation of spectral features with [e/Fe] is not linear in [e/Fe] 
when the range in [e/Fe] is large, especially for strong spectral 
features and for elements that play an important role in the 
atmospheric structure of stars. For most purposes this non- 
linearity is not important, as the derived abundances do not 
vary significantly beyond the abundance range used to create 
the models (for example, our model spectra were computed 
with variations of ±0.3 dex for most elements). As we will 
see in later sections, this is not true for the derived [Na/Fe] 
abundances, which reach as high as 1 dex in our best-fit mod- 
els. We have therefore computed new stellar spectral models 
with [Na/Fe] variation up to 0.9 dex and we use these new re- 
sponse functions in the present model. The use of these mod- 
els, rather than linear extrapolations from the 0.3 dex models, 
results in a derived [Na/Fe] that is lower by s» 0.2 dex for the 
most extreme cases. Moreover, there is some evidence that 
non-LTE effect s will result in s tronger Nal lines compared 
to LTE models dLind et al.ll201 lh . To first order these effects 
should only change the derived [Na/Fe] abundance, but fur- 
ther work will be required to assess whether or not non-LTE 
effects can impact the derived IMF values as well. 

Notice that since abundance ratio variations are grafted onto 
empirical stellar spectra at approximately solar metallic ity, the 
model is only applicable to systems with metallicities not too 
different from solar. The synthetic spectra allow us to cre- 
ate models with non-solar metallicities, but the extrapolation 
becomes less reliable as the metallicity deviates significantly 
from solar. 

The model also allows for variation in the effective tem- 
perature, T e fi, of the individual stars. This was implemented 
by computing new synthetic spectra with different T e s and 
then differentially modifying the empirical spectra. This pa- 
rameter allows us to explore the effect of variation in the 
isochrones with metallicity. For example, a change in [a/Fe] 
by ±0.2 or [Fe/H] by ±0.1 dex results in a roughly 50K 
chang e in the location o f the is ochrones near solar metallicity 
(e.g.. Potter et al.ll2007l 12008). Most SPS models that con- 
sider variable abundance patterns do n ot include the abun- 
dance effects on the iso chrones (e.g.. [Thomas et al.1 120031; 
iGraves & Schiav on 2008; iThomas et al.1 201 lab , wi th the ex- 
ceptio n of the models of ICoelho et al.1 (120071) and iLee et all 
(2009). 

We emphasize that these models are the only ones cur- 
rently available that follow the effect of individual elemental 

5 The synthetic spectra are in fact used in an absolute sense to bridge 
the gap in wavelength coverage between the MILES and IRTF libraries at 
0.74/Mi— 0.8/xm. This wavelength range is not used in the present work. 



abundance variation on the full spectrum. To our knowledge, 
all other models that include variation in individual elemen- 
tal abundances do so only on the effec t of spectral indices , 
principally the Lick index system (e .g., IThomas et al.l 12003k 
ISchiavon et alJl2005tlLee^tani2009l) . 

In CvD12 we considered IMFs that were a single power- 
law over the full mass range. In the present work we con- 
sider a three component power-law with separate indices, ai, 
a.%, and a-}, describing respectively the 0. 1M© < M < 0.5M©, 
0.5M© < M < 1 .0M©, and 1 .0M© < M < 100M© mass inter- 
v als. For referen ce, a Salpeter IMF has a\ = a 2 = 0:3 = 2.3 and 
a Kroupa (2001) IMF has a v = 1.3, a 2 = 2.3, a 3 = 2.3. Here 
we will fit for ct\ and a 2 and will fix = 2.3. Stellar rem- 
nants are inc luded in our M/L va lues according to a standard 
prescription (IConrov et al.ll2009l) . A iKroupal ?2001l) IMF is 
adopted as our reference 'MW' IMF. 

We have also added several additional parameters meant to 
capture the addition of minority stellar populations. In addi- 
tion to the age of th e bulk population, we include a parameter 
to allow 'frosting' (Trageret al. 2000) with a young popula- 
tion with an age of 3 Gyr (the youngest age in our model). 
We also include two parameters describing the contribution 
to the light from hot stars (whether they be young stars or 
hot horizontal branch stars). One of these parameters is the 
temperature of the star (ranging from 1 - 3 x 10 4 K) and the 
second is the fraction of the total flux comprised of these stars. 
Finally, we allow for the addition of arbitrary amounts of M 
giant light (specifically an M7 giant). For our purposes we 
consider these as nuisance parameters, as our main goal is to 
measure the IMF from integrated light. 

The fiducial model is characterized by 19 parameters: 11 
for the abundance pattern, one for the age, one for T e ff offsets, 
two for the IMF, two for the hot star component, one for the 
young component, and one for the fraction of additional M 
giant light. In addition to these 19 parameters, we also simul- 
taneously fit for the velocity dispersion and redshift of each 
galaxy, bringing the total number of parameters to be fit to 21 . 
These parameters are summarized in Table Q] 

The native resolution of the model is not constant in ve- 
locity. In order to facilitate comparison to the data, we have 
broadened the models to a constant velocity dispersion of 
100 kms -1 . We note that the observed spectra have also been 
convolved to a resolution that is constant in velocity, with a 
dispersion of 100 kms" (see Paper I for details). 

3.1. Fitting Procedure 

We adopt a Markov Chain Monte Carlo (MCMC) fitting 
technique in order to efficiently explore the large parameter 
space of the model. In the MCMC algorithm, a step is taken 
in parameter space and this step is accepted if the new loca- 
tion has a lower \ 2 compared to the previous location, and 
is accepted with probability e" Ax "/ 2 if the \ 2 lS higher than 
the previous location. Each step in the chain is recorded. 
After a sufficient number of steps the likelihood surface pro- 
duced by the chain will converge to the true underlying like- 
lihood. Con vergence is defi ned according to the prescription 
described in lDunkley et al.l (120051) . We have found that ~ 10 s 
steps are required in order to achieve convergence in all pa- 
rameters. The 'burn-in' region, where the chain is descending 
to the minimum of x 2 , is removed before analyzing the chain, 
as is standard practice. 

Priors on the model parameters must be specified in any 
MCMC algorithm to keep the chain from wandering into un- 
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Figure 1. Comparison between the spectrum of NGC 4621 and the best-fit model. Within each plotted wavelength interval, continuum-normalized fluxes are 
shown in the top panel and the ratio between model and data are shown in the bottom panel. The grey shaded bands demarcate the noise limits of the data. The 
data at A < OAfim are not used in the fit; the grey shaded bands become very large in that wavelength range to highlight this fact. The light grey spectral regions 
are masked from the fit because of possible emission line contamination. NGC 4621 has one of the most bottom-heavy IMFs in our sample. 



physical or unrealistic regions of parameter space. Our choice 
of priors on each parameter are specified in Table [T] Outside 
of the prior range, \ 2 is penalized by a Gaussian with a width 
of a p = 0.01. The adopted priors have little impact on our 
results because the parameters are always well-constrained 
within the prior range. 

In the present work we are most interested in the infor- 
mation contained in narrow spectral features. Therefore, 
we compute x 2 after dividing the model and data spectrum 
by a high-order polynomial. Operationally, we split the 
spectrum into four wavelength intervals: OAfim- 0.46/ira, 
0.46/ira- 0.55/im, 0.80/ira-0.89/ira, 0.96/x/w— 1.02/xm, and 
divide each sub-region by a polynomial of degree n where 
n = (A max - A m in) / 1 00A. Variation in n by ±2 induces changes 
to the spectrum at the < 0.1% level. 

The wavelength range 0.89-0.96/ira is not included be- 
cause the sky absorption corrections are significant in that 



wavelength interval (see Paper I for details). While both the 
models and data extend to A < QA/im, the Call H&K lines 
at 0.39/ira are not well-modeled at the sub-percent level be- 
cause these features form at very low densities in the stellar 
atmosphere. They are thus subject to non-LTE and 3D ef- 
fects, which are not included in the calculation of our syn- 
thetic spectra. As our goal is to model the observed spec- 
tra at the sub-percent level, we ignore the very blue end of 
the spectra. Nonetheless, the best-fit models, constrained at 
A > OAfim, do provide a good fit to the data at A < 0.4/xra, 
with typical residuals at the several percent level. Examples 
of the quality of fit in the very blue are given in Figures QJand 
|2] discussed below. 

We emphasize that the information contained in the broad- 
band shape of the spectral energy distribution is not used in 
the fitting procedure. We choose this approach because it is 
very difficult to achieve accurate flux calibration over a large 
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Figure 2. Comparison between the spectrum of NGC 524 and the best-fit model; see FigurefT|for details. NGC 524 has an inferred IMF similar to the MW. 



wavelength baseline at the percent level. 

We mask the spectral regions surrounding the H/3, [OIII] 
and [NI] emission lines and any spectral region contaminated 
by cosmic rays. 

4. CONSTRAINTS ON THE IMF 

In this section we present constraints on the stellar IMF for 
a sample of early-type galaxies, the nuclear bulge of M31, 
and globular clusters. Other parameters derived from our stel- 
lar population synthesis model, including detailed abundance 
patterns, will be presented in future work. 

4.1. Early-Type Galaxies 

A typical fit to the continuum normalized spectrum of an 
early-type galaxy is shown in Figures Q]and[2]for NGC 4621 
and NGC 524, respectively. In these figures we compare the 
observed spectra to the best-fit models. We also plot the ra- 
tio between the model and data and compare to the noise-to- 
signal ratio of the data. In an Appendix we show the fits to 
the other 33 galaxies from Paper I. 



In Table [2] we provide the mean S/N of the spectrum (aver- 
aged over the wavelength range used in the fit), the rms resid- 
ual between the best-fit model and data, the minimum x 2 /dof, 
the best-fit velocity dispersion, [Mg/Fe], and [Fe/H], the best- 
fit mass-to-light ratios in the r, I and /f-bands allowing for 
IMF variati on, and the be st-fit mass-to-light ratio assuming a 
fixed MW ( lKroupall200lb IMF. All quantities are measured 
within an effective circular aperture of radius R e /8 except for 
M31, which is within the central 4" = 15 pc. 

In Figure[3]we show the covariance between several derived 
parameters and the mass-to-light ratio, M/L, for NGC 4621. 
The latter quantity is shown in units of the mass-to-light ratio 
assuming a MW IMF, (M/L) M w- Constraints on the IMF will 
be plotted in this way throughout the paper in order to isolate 
the effects of the IMF on M/L from the effects of age, [Z/H], 
etc. The most important point to take from this figure is the 
very weak or absence of a correlation between the IMF and 
other parameters. We have inspected a much greater cross- 
section of parameter space than what is shown in Figure [3] 
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Figure 3. Covariance between various parameters and the normalization of the IMF. The latter is displayed as a ratio between the measured M/L and the M/L 
assuming a MW IMF. Results are shown for the fit to NGC 4621. The contours represent the 68% and 95% confidence limits (red and black lines, respectively). 
The symbols mark the median of the marginalized likelihoods. Notice that there is weak or no correlation between the various parameters and the IMF, and that 
each parameter is very well constrained. 
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Figure 4. Comparison between [Fe/H] and [Mg/Fe] values derived in this 
work and in K10. The formal statistical errors on our measurements are often 
smaller than the symbol size. Dashed lines represent constant total metallic- 
ity, [Z/H]. The upper and left panels show differences between our and KlO's 
measurements as a function of our values. 

and find in general very little correlation between parameters, 
indicating that each parameter is well-constrained by the data. 
It is also evident from this figure that the derived M/L is much 
larger than expected for a MW IMF. Indeed, NGC 4621 has 
one of the steepest IMFs in our sample (a Salpeter IMF is 
w 60% heavier than a MW IMF, and so NGC 4621, with an 
M/L that is twice as heavy as a MW IMF, has an IMF even 
steeper than Salpeter). 

NGC 4621 also has one of the highest [Na/Fe] values in 
our sample. The only sodium line being fit is the Nal dou- 
blet at 0.82/ira; the NaD feature at 0.59/ira is not covered 
in the observed spectra. The constraint on [Na/Fe] is there- 



fore coming from a combination of the feature at 0.82/jm and 
the fact that the sodium abundance has an indirect effect on 
the entire spectrum due to its influence on the free electron 
abundance in stellar atmospheres (see CvD12 for details). It 
is important to notice that even when allowing the [Na/Fe] 
abundance to reach very high values, a bottom-heavy IMF is 
still favored for this particular galaxy. In other words, the 
very strong Nal feature is due both to high [Na/Fe] abundance 
and a prep onderance of low-ma ss stars. A similar result was 
found by Spinie llo et alj d2012l) who analyzed the spectra of 
two higher-redshift early-type galaxies. Spectra that cover the 
NaD feature will be very valuable for placing upper limits on 
the [Na/Fe] abundance. Of course, the NaD feature can only 
place an upper limit on the [Na/Fe] abundance because this 
feature may also be influenced by absorption from gas in the 
interstellar medium. 

Before considering constraints on the IMF for the full sam- 
ple, we first compare our best-fit [Fe/H] and [Mg/Fe] abun- 
dances to the [F e/H] and faZFe] abundan ces derived for the 
same galaxies by Kuntschner et al. (2010, K10). The latter are 
based on the SPS model of Sc hiavonl d2007l) . This comparison 
is shown in Figure [4] In K10 all of the a elements track each 
other (except for Ti), so [Mg/Fe]=[a/Fe] in that work. Notice 
that the comparison is only approximate because K10 derive 
abundances within true circular apertures with radii equal to 
R e /8, while we derive abundances within a slit of radius R e /% 
with a radial weighting meant to mimic a circular aperture 
(see Paper I). If the galaxies were perfectly smooth in the az- 
imuthal direction, then the two approaches would yield identi- 
cal spectra. However, it is clear from the 2D stellar population 
maps in K10 that there is significant variation in the stellar 
populations at fixed radius. In any event, the iron abundances 
and a-enhancements generally agree to within < 0.1 dex (the 
scatter between the two methods is 0.05 and 0.07 for [Mg/Fe] 

6 K10 quote total metallicity, Z, rather than [Fe/H]. We have converted 
their results to [Fe/H] via the equation [Fe/H]=Z-0.75[ct/Fe]. This relation 
was derived from tabulated values of [Fe/H], Z, and [ct/Fe] kindly provided 
to us by R. Schiavon. 
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Figure 5. Best-fit M/Lk in units of the best-fit M/Lk for a MW IMF. This quantity is sensitive only to the IMF. Results are shown as a function of a (left panel) 
and [Mg/Fe] (right panel). All galaxies would lie on the dotted line if the IMF was universal and of th e MW form. The red symbol is the result for the stacked 
spectrum of four massive early-type galaxies from the Virgo cluster studied in van Dokkum & Conrov (2010), the orange symbol is M87, and the blue symbol is 
M3 1 . The Spearman correlation coefficient, p, is included in the legend of each panel. The data therefore support a scenario wherein the IMF becomes steeper 
with increasing a and/or [Mg/Fe] . 



and [Fe/H], respectively), which is encouraging considering 
the different apertures and modeling techniques. It is also ev- 
ident from Figure |4] that there is an anti-correlation between 
[Fe/H] and [Mg/Fe] such that the total range in metallicity, 
[Z/H], is only « 0.2 dex. 

The principal result of this paper is shown in Figure |5] In 
this figure we show the /T-band mass-to-light ratio, M/Lk 
normalized to the mass-to-light ratio expected for a MW IMF. 
This quantity is directly related to the IMF and is plotted as 
a function of galaxy velocity dispersion, a, and [Mg/Fe]. Er- 
ror bars are marginalized 68% confidence limits. The most 
important conclusion to draw from this figure is that signifi- 
cant galaxy-to-galaxy variation in the IMF is inferred, ranging 
from IMFs slightly lighter than the MW, to IMFs significantly 
heavier than the MW. In these units a Salpeter IMF has a value 
of k 1 .6, so there are galaxies that have inferred IMFs steeper 
even than Salpeter. 

In Figure q\ we also see clear evidence for correlations be- 
tween the IMF and a and between the IMF and [Mg/Fe]. The 
Spearman rank correlation coefficients, p, are 0.58 and 0.81, 
for the IMF correlation with a and [Mg/Fe], respectively. 
[Mg/Fe] is an indicator of the star formation timescale, and 
the fact that the IMF appears to correlate more strongly with 
this quantity than with a suggests that the intensity of star for- 
mation on kpc scales may play a key role in determining the 
shape of the IMF (see also S mith et aT]|2012l who reach simi- 
lar conclusions based on analyzing stacked spectra of g alaxi es 
in the Coma cluster). We return to this topic in Section 1631 

Figure [6] shows the best-fit M/Lk ratios as a 
function of the total met allicity, [Z/H], es timated as 
[Z/H]=[Fe/H]+0.94[Mg/Fe] dThomas et al.ll2003l) . The cor- 
relation here is much weaker than with either a or [Mg/Fe] 
(p = 0.33), suggesting that total metallicity is not the funda- 
mental variable driving IMF variation. The lack of correlation 
between [Z/H] and IMF is also an important diagnostic for 
any potential model systematic uncertainties that may corre- 
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Figure 6. Best-fit IMF (plotted as in Figure [5J as a function of the best-fit 
total metallicity, [Z/H]. The Spearman correlation coefficient, p, is included 
in the legend. The correlation between IMF and [Z/H] is much weaker than 
between either <r or [Mg/Fe], suggesting that total metallicity is not the key 
variable driving IMF variation. This plot also implies that any model un- 
certainties that correlate with [Z/H] are unlikely to affect the qualitive IMF 
variation displayed in Figure|5] 

late with metallicity, as we discuss further in i j6.ll 

4.1.1. Quality of Fit 

Our constraints on the IMF are derived by marginalizing 
over 19 other parameters, including 11 parameters character- 
izing the detailed abundance pattern and four nuisance param- 
eters meant to capture uncertain aspects of stellar population 
synthesis modeling. Despite this high degree of flexibility, the 
error bars for most galaxies are inconsistent with a MW IMF. 
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Table 2 

Results from Stellar Population Synthesis Modeling 



Galaxy 


(S/N) 


rms (%) 


X 2 /dof 


<t (km s 1 ) 


[Mg/Fe] 


[Fe/H] 


M/L r 


M/L, 


M/L K 


(M/L k )mw 


M31 


439 


0.86 


10.21 


170 


0.19 


0.03 


6.17 


3.69 


1.20 


0.89 


NGC 474 


219 


0.76 


2.13 


178 


0.17 


-0.05 


2.98 


1.88 


0.69 


0.58 


NGC 524 


259 


0.82 


3.11 


260 


0.22 


-0.10 


3.80 


2.36 


0.85 


0.78 


NGC 821 


266 


0.69 


2.55 


216 


0.20 


-0.05 


3.71 


2.30 


0.81 


0.68 


NGC 1023 


440 


0.63 


6.14 


217 


0.18 


-0.01 


5.26 


3.20 


1.04 


0.68 


NGC 2549 


371 


0.76 


6.28 


157 


0.12 


0.07 


1.92 


1.22 


0.42 


0.63 


NGC 2685 


189 


0.95 


2.27 


96 


0.13 


-0.03 


1.80 


1.16 


0.41 


0.50 


NGC 2695 


221 


0.75 


2.11 


210 


0.25 


-0.09 


5.81 


3.56 


1.28 


0.76 


NGC 2699 


180 


0.81 


1.72 


154 


0.15 


0.03 


3.80 


2.39 


0.87 


0.66 


NGC 2768 


239 


0.89 


2.97 


222 


0.19 


-0.09 


2.98 


1.85 


0.64 


0.67 


NGC 2974 


260 


0.75 


2.90 


247 


0.20 


-0.06 


5.22 


3.21 


1.16 


0.80 


NGC 3377 


352 


0.77 


5.88 


140 


0.19 


-0.06 


2.52 


1.60 


0.57 


0.49 


NGC 3379 


431 


0.65 


5.96 


225 


0.24 


-0.08 


5.58 


3.40 


1.12 


0.70 


NGC 3384 


432 


0.71 


7.74 


168 


0.13 


0.05 


2.08 


1.30 


0.41 


0.65 


NGC 3414 


264 


0.84 


3.89 


243 


0.23 


-0.09 


4.92 


3.00 


1.04 


0.79 


NGC 3608 


271 


0.76 


3.25 


194 


0.22 


-0.10 


4.31 


2.67 


0.93 


0.70 


NGC 4262 


227 


0.68 


1.78 


214 


0.27 


-0.10 


6.96 


4.21 


1.45 


0.78 


NGC 4270 


132 


0.80 


0.88 


150 


0.08 


-0.01 


2.38 


1.54 


0.60 


0.59 


NGC 4278 


286 


1.05 


6.23 


273 


0.32 


-0.25 


8.68 


5.08 


1.61 


0.87 


NGC 4382 


328 


0.66 


4.18 


184 


0.11 


-0.02 


1.05 


0.69 


0.25 


0.40 


NGC 4458 


167 


0.97 


2.08 


116 


0.26 


-0.19 


3.75 


2.37 


0.94 


0.56 


NGC 4459 


269 


0.71 


2.70 


186 


0.12 


0.00 


1.94 


1.26 


0.46 


0.57 


NGC 4473 


270 


0.58 


2.03 


194 


0.20 


-0.01 


4.56 


2.82 


1.01 


0.75 


NGC 4486 


244 


0.82 


3.17 


385 


0.33 


-0.16 


9.82 


5.66 


1.71 


0.90 


NGC 4546 


286 


0.73 


3.04 


233 


0.24 


-0.03 


7.16 


4.30 


1.40 


0.78 


NGC 4552 


329 


0.68 


3.90 


271 


0.26 


-0.00 


8.41 


4.97 


1.55 


0.76 


NGC 4564 


238 


0.76 


2.52 


175 


0.22 


0.04 


5.14 


3.10 


0.96 


0.73 


NGC 4570 


229 


0.67 


1.82 


194 


0.20 


0.02 


7.19 


4.27 


1.37 


0.79 


NGC 4621 


343 


0.68 


4.30 


232 


0.25 


-0.02 


7.20 


4.34 


1.39 


0.71 


NGC 4660 


240 


0.64 


1.79 


212 


0.24 


0.00 


4.50 


2.74 


0.89 


0.73 


NGC 5308 


167 


0.60 


0.82 


241 


0.24 


-0.07 


4.76 


2.93 


1.03 


0.73 


NGC 5813 


204 


0.86 


2.33 


233 


0.28 


-0.12 


4.62 


2.85 


1.04 


0.76 


NGC 5838 


231 


0.62 


1.46 


290 


0.21 


0.01 


5.86 


3.57 


1.21 


0.73 


NGC 5845 


166 


0.57 


0.70 


263 


0.22 


-0.00 


5.38 


3.29 


1.09 


0.74 


NGC 5846 


192 


0.87 


1.95 


246 


0.26 


-0.07 


7.23 


4.34 


1.46 


0.82 



Note. — The typical fractional error on M/L is 7% (statistical). Tests performed in Section d31 suggest that the 
systematic errors on M/L are not larger than 50%. The typical fractional error on (M/L)mw is 2% (statistical). The 
formal errors on [Mg/Fe] and [Fe/H] are < 1%. All quantities are measured within an effective circular aperture of radius 
S e /8 except for M31, which is within the central 4" = 15 pc. 



Nonetheless, we might also ask how well-fit are the data 
by models in which the IMF is fixed to the MW value for 
all galaxies. We have thus re-fit each galaxy in our sample 
with a model that is identical to the fiducial mode l except 
that th e IMF is fixed to the MW value (specifically a lKroupal 
1200 ll IMF). Inspection of the best-fit models confirms our 
expectation that the Wing-Ford band at 0.99/ira is a power- 
ful IMF diagnosti c when considered in conjunct ion with other 
spectral features (IConroy & van Do kkum 2012). In contrast, 
the Nal feature at 0.82/im can be reasonably well-fit with a 
model without IMF variation but with higher [Na/Fe] abun- 
dance. The quality of the fit over the full spectral range for 
such models is, however, poorer (see below). 

As an example of the quality of the fits, we show in Figure|7] 
a zoom-in on the Wing-Ford band for the two galaxies shown 
in Figures 1 and 2. In this figure the data are compared to two 
models, one in which the IMF is allowed to vary in the fit, and 
a second in which the IMF is fixed to the MW value. In the 
case of NGC 4621, where the best-fit IMF is significantly dif- 
ferent from the MW IMF, we see that a fixed IMF model fails 
to capture the depth of the observed Wing-Ford band. This 
suggests that the Wing-Ford feature can be a strong discrim- 
inant between fixed and variable IMF models. In the case of 
NGC 524 there is essentially no difference between the pre- 
dictions of the fixed and variable IMF models. This occurs 
because the best-fit variable IMF model returns a MW IMF 



for this galaxy. Nonethelss, both models predict a Wing-Ford 
band that is slightly too weak compared to the data for this 
galaxy. Formally the fits are acceptable, and we remind the 
reader that the data points are highly covariant owing to the 
velocity broadening of the spectrum. 

The improvement in the quality of the fits when a variable 
IMF model is considered can be quantified by the difference 
in the minimum of \ 2 - Figure|8]shows A(x 2 ) between models 
that do and do not allow for variation in the IMF as a function 
of the difference in the best-fit M/L between the two models. 
The latter quantity is in units of the uncertainty in M/L. Se- 
lected galaxies are labeled in the figure. There is a clear trend 
in the sense that galaxies with mass-to-light ratios that devi- 
ate strongly from what is expected for a MW IMF have large 
A(x 2 )- This of course is not surprising, as it is precisely the 
X 2 value that the fitting routine uses to determine the best-fit 
M/L ratios. 

Quantifying the prefere nce of one m odel over another is 
notoriously complex (see iLidd lel 12007 , for a brief review). 
Among the most popular metrics are the Akaike and Ba yesian 
information criteria (the AIC and BIC, respectively; Akaike 
[19741 IScwarj[l978l) . The former is defined as AIC= x 2 + 2k 
where k is the number of free parameters. The AIC attempts 
to balance a change in x 2 against increased model complex- 
ity. In our case the two models under consideration differ by 
two parameters, and so an equivalence in AIC would corre- 
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Figure 7. Zoom-in on the IMF-sensitive Wing-Ford band for NGC 4621 
(top) and NGC 524 (bottom). These are the two galaxies shown in Figures 
1 and 2. The observed spectrum (black line and grey band encompassing 
the \a errors) is compared to several best-fit models. The fiducial model, 
which fits the full wavelength range and includes parameters for the IMF 
(blue line), is compared to a model where the IMF is fixed to the MW value 
(red line). The Xmin ' s shown for each model fit, computed over the range 
0.988^-0.997^?!. The spectrum of NGC 462 1 is poorly fit by a MW IMF, 
while the spectrum of NGC 524 is equally well-fit by a fixed or variable IMF 
model. Notice that the best-fit models for NGC 524 are formally acceptable 
fits to the data around the Wing-Ford band, despite the appearance that the 
models slighly underpredict the feature strength. 

spond to A(x 2 ) = 4. According to Jeffreys' scale, A(AIC) > 5 
is judged as 'strong' evidence for one mo del over a nother, 
while A(AIC) > 10 is judged as 'decisive' (lLiddlel2007l) . We 
can therefore associate A(x 2 ) > 14 as a threshold for decisive 
preference for the model with IMF variation over the model 
without IMF variation. This threshold is included in Figure[H] 
Notice that every galaxy with > 3er evidence for an IMF dif- 
ferent from that of the MW has A(x 2 ) > 14, indicating that, 
in the context of our model, the variable IMF model is very 
strongly (decisively) preferred. We have not considered the 
BIC herein because it depends on the number of data points 
and assumes that the data points are independent. This con- 
dition is clearly violated in our case because the velocity dis- 
persion of each galaxy is larger than the wavelength sampling; 
adjacent data points are therefore highly correlated. 

It is apparent that the observed CaT feature is not very well- 
fit by the models in Figures Q] and [2] especially in the cores of 
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Figure 8. Difference in the minimum of \ 2 between models with and with- 
out variation in the IMF, as a function of the difference in best-fit M/Lx in 
units of the uncertainty. Symbol colors are as in Figure H] The two open 
symbols have A(x 2 ) < and so for these galaxies we plot |A(x 2 )|- The 
model with IMF variation has two additional degrees of freedom compared 
to the fixed IMF model. In the context of the AIC, the variable IMF model is 
'decisively preferred for A(x 2 ) > 14; see the text for details. 

the lines. This issue is explored further in Figure In the left 
panels we show residuals between the best-fit model and data 
around the CaT feature for all 35 galaxies, split according to 
their M/L ratio. Also included is the mean residual for the 
entire sample (red lines) and the mean residuals within each 
panel. Comparison of the green and red lines in these pan- 
els suggests that the CaT feature is poorly fit for all galaxies, 
and the quality of the fit does not vary much with M/L. In 
other words, there appears to be an overall offset between the 
models and the data, independent of the best-fit IMF. Recall 
that the [Ca/Fe] abundance is included in the model fits, and 
is well-constrained by the Cal feature at 0.4227/im. The off- 
set between the models and data is therefore unlikely to be a 
calcium abundance effect. 

The tension between the observed and modeled CaT is dis- 
played in another way in the right panel of Figure |9] Here 
we show the equivalent width (EW) of the CaT feature (see 
IConrov & van D okkum 2012, for the definition of this index) 
for both the models and the data as a function of the best-fit 
M/L ratio. The overall trend of decreasing CaT strength with 
increasing M/L ratio is expected because the CaT is a giant- 
sensitive feature. The offset between the models and data is 
also apparent, and the offset is approximately constant with 
M/L. The origin of this offset is unclear. It may be due to 
the fact that the CaT absorption lines form at very low Rosse- 
land optical depths, and so the model becomes sensitive to the 
treatment of the tenuous outer atmospheres of model giants. It 
may also reflect an underlying error in the construction of the 
base set of empirical models onto which the synthetic spectra 
are grafted. Regardless of the origin, the most important point 
is that the offset is almost completely independent of M/L, 
and implies that this tension between the model and data is 
unlikely to be driving the qualitative variation in the IMF that 
we observe. 
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Figure 9. Comparison of models and data around the CaT feature. Left Panel: Residuals between the best-fit model and data for galaxies split according to their 
best-fit IMF (light IMFs in the top panel, heavy IMFs in the bottom panel). The green lines are the mean residuals within each panel, and the red line is the mean 
residual for the entire sample. Dotted lines mark the location of the CaT features. Right Panel: Comparison of the CaT EW between the best-fit models and the 
data as a function of the best-fit IMF. Solid lines are a linear fit to the trends. Notice that while the best-fit models are offset low compared to the data, the trend 
of CaT strength with the inferred IMF is similar between the models and the data. These panels suggest that there is an overall offset in the predicted strength of 
the CaT feature in the current version of our models. 



4.2. The Nuclear Bulge ofM31 

The form of the IMF in the nuclear bulge of M31 was 
the subject of i n tense debate in the 1970s and 1980s. 
ISpinrad & Taylor! (119711) observed the Nal 0.82/xm fea- 
ture and concluded that the bulge was dwarf-dominated 
((M/L) stals = 44). A large number o f subsequent observations 
have l ead to conflicting results (e.g..|OconnelJl976tlWhitfordl 
19771: iFrogel et allll978tlCohenlll978HFaber & Frenchl ll980l 
Carter et all 119861 iDelisle & Hardyl 119921) . There is broad 



agreement that the Nal feature is quite strong, especially in 
the nuclear region (the inner 2-4"). The debate centered 
on whether this enhancement is due primarily to an increase 
in the dwarf star contribution or an increase in the [Na/Fe] 
abundance. If interpreted as the form er, the implied mass - 
to-light ratio would be M/L B « 28 dFaber & Frenchl 11980b . 
which would be in strong conflict with dynamical constraints 
(iSagliaet alJl20Toh . 

In light of the controversial history of the nuclear bulge of 
M31 we decided to include M31 in our analysis. The LRIS 
spectrum of M31 was extracted within a radius of 4", which 
corresponds to the central 15 pc of M3 1 . We fit our SPS model 
to this spectrum in a manner identical to the early-type galaxy 
data. The resulting best-fit M/Lk is shown as a blue symbol 
in Figure[5] We derive a mass-to-light ratio that is in between 
a MW IMF and a Salpeter IMF, and is highly inconsistent 
with dwar f-ric h (bottom-heavy) IMFs. As will be discussed 
in Section lBTTl our derived mass- to-light ratio for M 3 1 agrees 
well with dynamical constraints (Saglia et al. 201(3). 

The nuclear spectrum shows an abundance pattern typical 
of the early-type galaxy sample. It is a-enhanced in all the 
a elements except for Ca and Ti, and also enhanced in N and 
C. The derived sodium abundance is however one of the high- 
est of the sample, at [Na/Fe] w 1.0. More detailed models and 
coverage of the NaD spectral feature will shed further light on 
the sodium abundance in M3 1 and other galaxies. Removal of 



the Nal feature results in a change in M/Lk of -50%, while 
removal of both the Nal and CaT features results in a change 
in M/Lk of +50%. These changes are typical of the sys- 
tematic unc ertainties in our present modeling technique (see 
Section |5T2"l for details). 

The basic result from this section is that the nuclear re- 
gion of M3 1 is entirely consistent with a normal IMF. We can 
certainly rule out extreme ma s s-to- light ratios, such as those 
suggested by iFaber & French! ( 1980) based on data obtained 
within a slightly larger aperture (2" x 4" compared to an 8" 
circular diameter herein). We derive much lower M/L ratios 
compared to Faber & French in part because we allow for 
large [Na/Fe] variation (reaching nearly 1 dex for the nuclear 
spectrum of M31), and in part because the FeH feature places 
a strong upper limit on the allowed M/L values. 

4.3. Metal-Rich Globular Clusters in M31 

We now turn to constraints on M /L for four metal-rich glob- 
ular clusters (GCs) in M3 1 . These clu sters were selected to 
be metal-rich, a-enhanced, and old dCaldwell et al.l |20T11) . 
They have stellar masses in the range 5.5 < logM/M Q < 6.1, 
based on dynamica l constraints, and long relaxation times 
(Strader et al. 201 1). One of the most intriguing facts of these 
clusters is that their /^-band mass-to-light ratio s are a fac- 
tor of ~ 2 lighter than expected for a MW IMF ( Strad er et al.l 
1201 ll) . GCs are simpler than galaxies in the sense that the stars 
within them are approximately c oeval, and they are k nown to 
contain little or no dark matter (IConrov et al.ll201ll and ref- 
erences therein). Comparison between dynamical and stellar 
population-based constraints on M/L is therefore straightfor- 
ward. 

We have fit the stacked spectrum of these four clusters with 
our model. The resulting best-fit A'-band mass-to-light ra- 
tio is shown in Figure [10] plotted versus the best-fit [Mg/Fe] 
abundance. We also include our sample of early-type galaxies 
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Figure 10. M/Lk versus [Mg/Fe] for the early-type galaxies analyzed in 
the present work (grey symbols) and for the stacked spectrum of four metal- 
rich globular clusters in M3 1 (red symbol; statistical error bars are smaller 
than the symbol size). We also compute and include in the figure the mass- 
to-light ratio computed without black hole and neutron star stellar remnants 
(green symbol). The d ynamically-based M /Lk for the four M31 clusters is 
shown as a solid band (Strader et al. 201 1). The M/Lk for a 13 Gyr, solar 
metallicity MW IMF model is 0.9, implying that these globular clusters are 
lighter in mass than expected for a MW IMF, and consistent with dynamical 
constraints. 

that have ages > 10 Gyr. 

The standard model M /Lk ratio, which includes a contri- 
bution from stellar remnants, is shown as a red symbol. We 
have also computed a mass-to-light ratio without inclusion of 
black hole and neutron star stellar remnants (green symbol) . 
A paucity of remnants in GCs may arise because the shallow 
potential well of the GCs permits the escape of massive rem- 
nants when the progenitor star explodes. In any event, the 
effect of black holes and neutron stars on the mass-to-light 
ratio is small. 

Dynamical m ass-to-light ratios h ave been computed for 
these clusters by IStrader et al.l (1201 lb : the full range spanned 
by these four clusters is shown as a band in Figure [10] For 
reference, a 13 Gyr, solar metallicity MW IMF model has 
M/L K = 0.9. 

Both the stellar population-based and dynamically-based 
mass-to-light ratios point toward a bottom-light IMF in these 
GCs. This broad agreement between two completely indepen- 
dent techniques is very encouraging, and lends support to our 
model results. 

5. TESTS 

In this section we discuss several tests of our principle result 
that the IMF is not universal. We confro nt ou r best-fit mod- 
els with dynamical constraints in Section I5TI and we di scuss 
several possible systematics in the model in Section l5^2l 

5.1. Comparison to Dynamical Masses 

In this section we compare mass-to-light ratios estimated 
from our models to dynamical estimates. For the 34 galaxies 
drawn from the S AURON survey, dynamical mass-to-light ra- 
tios are bas ed on Jeans axisymmetric modeling of the 2D ve- 
locity field dSco tt et al. 2009). For these galaxies the dynami- 
cal estimates are quoted within R e . We have adopted a typical 
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M/L Iidyn 

Figure 11. Comparison between SPS-based and dynamically-based M/L;. 
Notice that the SPS-based mass-to-light ratios never violate the dynamical 
constraints. M31 is shown separately (blue symbol) because the dynamical 
constraints on M/L come from a different source and technique than the rest 
of the sample. The M/L] gps ratio for a MW and a Salpeter IMF is 2.2 and 
3.4, respectively, assuming an age of 13 Gyr and solar metallicity. 

uncertainty on the d ynamical M/L values of 6% following 
Cappellari^eLalJ 42006D. 

The comparison between our stellar population-based 
mass-to-light ratio, M/Lsps, and the dynamically-based 
M/Ldyn is shown in FigureQT] Mass-to-light ratios are quoted 
in the /-band in order to provide a direct compariso n with re- 
sults from the SAURON survey dScott et al .1120091) . Notice 
that the SPS and dynamically-based mass-to-light ratios are 
obtained within different physical apertures (/? e /8 for the for- 
m er and R e for the la tter) . 

Sagl ia et all (|2010) have estimated the dynamical mass-to- 
light ratio for M3 1 in the R-band based on long-slit data. We 
have assumed (M /Lr)/(M/Lj)=1.3, appropriate for a 13 Gyr 
single-age population with solar metallicity, in order to trans- 
late their value into M/L/. Saglia et al. provide M/Lr as a 
function of radius; we adopt a dynamical mass-to-light ratio 
within 4" for comparison to our SPS-based value. 

The SPS-based M/L values are always less than or equal 
to the dynamical M/L values within the 2cr errors, which is 
remarkable given the small size of the SPS-based errors. Even 
in cases where we infer bottom-heavy IMFs, the derived M/L 
values do not exceed the dynamical constraints. There are 
galaxies for which the SPS-based M/L ratios are less than 
the dynamically-based ones. In these cases there is room for 
additional, non-stellar mass (i.e., dark matter). 

5.2. Systematics 

In this section we explore several variations to our fiducial 
model. The variations are of two types: changes to the spec- 
tral range included in the fit, and changes to the properties of 
the stellar population model. We consider the effect of these 
variations on the derived M/Lk ratios. 

We begin by considering variation in the spectral range in- 
cluded in the fits. In the top panels of Figure Q~2] we consider 
the effect on M /Lk of removing one or more classic IMF- 
sensitive spectral features. In the left panel we remove the 
Nal spectral region (0.81/im-0.83/i/«). In the middle panel 
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Figure 12. Exploration of systematic effects on the derived mass-to-light ratios. In each panel, the M/Lk is plotted in units of (M/Lk)mw (as in previous 
figures). Dashed lines indicate a slope change of ±50%. The open symbols in each panel indicate galaxies with ages younger than 10 Gyr. The upper panels 
highlight the effect of removing one or more spectral regions from the fit. In the left panel the Nal feature is removed, in the middle panel both the Nal and 
CaT features are removed, and in the right panel the Wing-Ford band spectral region is removed. Neglecting certain IMF-sensitive features affects the overall 
normalization of the M/Lk values at the as 50% level, but relative trends are robust. The lower panels highlight other model permutations. In the left panel we 
turn off the spectral variation due to the IMF in the blue spectral region (A < 0.8/im), in the middle panel we turn off blue IMF variation and remove Nal and CaT 
from the fits, and in the right panel the three nuisance parameters are not included in the fit. In the middle panel the only constraints on the IMF are coming from 
the Wing-Ford band spectral region. These tests suggest that the absolute M/Lk ratios carry systematic uncertainties at the < 50% level, and that the Wing-Ford 
band alone is likely not sufficient to place strong constraints on the IMF over the range of IMFs probed. 



we remove both the Nal and CaT features (0.81/zm— 0. 83 jttm 
and 0.845//m-0.87/im). In this case the Wing-Ford band is 
the only classic IMF-sensitive feature included in the fit. In 
the right panel we remove from the fit all information beyond 
A > 0.96/im, which includes the Wing-Ford band. 

Qualitatively we recover the same trends no matter which 
set of IMF-sensitive features are used. In particular, in all 
cases we find evidence for IMF variation. This provides 
strong confirmation that our basic result is not influenced by 
a single spectroscopic feature. There are however noticeable 
offsets between the various permutations, especially in the up- 
per left panel of Figure [12] This means that in detail the var- 
ious IMF-sensitive features are favoring different values of 
M/Lk, which in turn suggests that there are residual system- 
atics in the modeling. One possible explanation lies in the fact 
that these features are each most sensitive to a different range 
of stellar masses (see CvD12 for details). The data may be 
demanding an IMF that is more complicated than the broken 
power-law that we adopt. These issues will be explored in 
future work. 

We have also considered a model where the effect of the 



IMF on the spectrum is limited to the red (A > 0.8/im). In 
this model the blue spectrum is completely insensitive to IMF 
variation. Our goal here is to assess the extent to which the 
IMF spectral signatures in the blue are driving the IMF con- 
straints. The result is shown in the bottom left panel of Figure 
fl2l The M/Lk ratios in this case are again similar to the fidu- 
cial model, with a scatter between the two of 30%. The sizable 
scatter suggests that there is in fact some information on the 
IMF in the blue spectral range, which in retrospect should not 
be surprising (see e.g., Figure 10 in CVD12). However, to the 
extent that the blue spectral region is sensitive to the IMF, it 
is almost certainly sensitive to a different stellar mass range, 
since the coolest M dwarfs have such red spectral energy dis- 
tributions. 

In the lower middle panel of Figure Q~2] we consider an ex- 
treme model in which the IMF variation in the blue spectral 
region is suppressed and the Nal and CaT spectral features are 
masked from the fit. In this case IMF constraints are due only 
to the Wing-Ford band. The scatter between this model and 
the fiducial is large. Evidently, over the range of IMFs probed 
in our fiducial model, the Wing-Ford spectral region is not a 



The Stellar IMF in Early-Type Galaxies. II. Results 



13 



particularly powerful probe of the IMF when considered in 
isolation. This is also evident in Figure 10 of Paper I, which 
shows that the relation between the Wing-Ford band and a 
has considerable scatter. It can however be a useful probe 
of the IMF in conjunction with other features, as illustrated 
in the upper panels of this figure. As discussed extensively in 
CvD12, the Wing-Ford band is most sensitive to the very low- 
est mass stars, and so the relatively large scatter between the 
IMF inferred from the Wing-Ford band alone and the fiducial 
model may be pointing toward the need to consider additional 
flexibility in the model IMF at low masses. 

We also consider variation in the underlying model by re- 
moving the three nuisance parameters in the fiducial model, 
including log(M7III), log(/h ot ), and 7h ot . This is shown in 
the lower right panel of Figure Q~2] Here again the qualita- 
tive trend of IMF variation is robust. In detail however there 
is a fair degree of scatter between this model and the fiducial, 
and the typical M /Lk is ~ 30% higher when the nuisance 
parameters are removed. The most important nuisance pa- 
rameter driving these differences is log(M7III). Galaxies for 
which this parameter is close to zero have very similar M /Lk 
regardless of whether or not the nuisance parameters are in- 
cluded. In contrast, those galaxies which have the largest 
values of log(M7III), of order -1.0, show the largest differ- 
ences in M /Lk, of approximately 50%, when the nuisance 
parameters are or are not included. More detailed modeling of 
the cool giant contribution to the observed spectra will be re- 
quired before more accurate IMF constraints can be obtained. 

The results of this section demonstrate that our primary re- 
sult, that the IMF varies from galaxy to galaxy, is robust to a 
variety of model permutations. There are however systematic 
uncertainties in the mass-to-light ratios at the m 50% level at- 
tributable to the choice of the parameter set and wavelength 
coverage. To first order, this systematic uncertainty applies 
equally to all mass-to-light ratios, so that relative trends are 
quite robust. Further work is needed to address the remaining 
sources of systemtatic uncertainty in the model. 

6. DISCUSSION 
6.1. Is IMF Variation Really Necessary? 

One of the key goals of this paper has been to demonstrate 
that IMF variation is not only plausible but necessary given 
the data and a very flexible SPS model. We have employed 
a model that allows for variation in 1 1 elements and an ad- 
ditional 6 parameters (not including the IMF) meant to cap- 
ture a wide range in possible underlying stellar populations. 
Nonetheless, our model is obviously not exhaustive, and so 
one may wonder whether additional components not included 
in our model may mimic the effect of IMF variation. Ulti- 
mately this question must be addressed with further quanti- 
tative modeling; in this section we limit ourselves to more 
general statements. 

The most significant argument favoring IMF variation is 
that the three classic IMF-sensitive spectral features — Nal, 
CaT, and FeH — are all telling a broadly consistent story 
(as evidenced by the upper panels of Figure [T2b. Any model 
that is proposed to explain away the purported IMF variation 
must simultaneously decrease the strength of CaT, increase 
the strength of Nal, and also explain the strength of the Wing- 
Ford band. It is not easy to imagine a single population that 
can satisfy these constraints. 

As an example, cool stars have strong TiO bands through- 
out their spectra. In particular, the 2-3 band of the 8 sys- 



tem of TiO coincides with th e FeH feature at 0.99/j.m (se e 
the supplementary material in Ivan Dokkum & Conrovll2010l) . 
while the 1-0 band of the 5 system coincides roughly with 
the Nal feature at 0.82/ira. An increase in the contribution 
of late M giant light may therefore increase the strength of 
the Wing-Ford band and the Nal feature. We have included a 
cool M giant in our model as a nuisance parameter, but one 
could argue that other, perhaps more peculiar cool stars exist 
in early-type galaxies. This scenario is however strongly con- 
strained by the central wavelength of the Nal feature, which 
becomes systematically bluer with increasing velocity disper- 
sion (see Paper I). The reason for this is because, while the 
Nal and TiO features roughly coincide, the former has a bluer 
central wavelength than the latter. If TiO was the cause of the 
strong Nal and Wing-Ford features, then the centroid of the 
Nal feature would become redder, not bluer with increasing 
velocity dispersion. Furthermore, the strength of TiO bands is 
constrained from other, cleaner, spectral regions including the 
0-0(5 TiO bandhead at 0.88 -0.89fim. In fact, when both the 
Nal and CaT features are masked, a strong constraint on the 
IMF is still obtained because the 0.88 -0.89/ura TiO bandhead 
constrains the possible contribution from TiO to the Wing- 
Ford band. 

One may also wonder if, rather than an increase in low-mass 
stars, one could explain the observations with a decrease in 
the number of giant branch stars. These two options would be 
at least approximately equivalent were it not for the turn-off 
stars, which provide a benchmark against which one can con- 
strain both the relative number of late-type giants and dwarfs. 
Here again the Wing-Ford band provides a useful diagnos- 
tic. As demonstrated in CvD12, to some extent (and ignor- 
ing constraints from other spectral regions), the strength of 
the Nal and CaT features can be equally well fit with an in- 
crease in dwarfs or a decrease in giants. But this is not true 
for the Wing-Ford band, which becomes strong only in the 
latest M dwarfs due to FeH absorption. For a MW IMF, these 
M dwarfs are too faint to contribute any signal to the Wing- 
Ford band, even when luminous giants are removed from the 
model. In fact, decreasing the number of giants in the model 
actually decreases the strength of the Wing-Ford band for a 
MW IMF. This occurs because the Wing-Ford band is also in- 
fluenced by TiO absorption, which decreases as the number 
of giants decreases. 

Another concern is related to the reliability of the mod- 
els at high metallicity. Recall that we are not explicitly al- 
lowing the isochrones to vary with metallicity. Instead we 
have adopted an additional free parameter, A(r e ft), which al- 
lows us to mimic shifts in the isochrones by varying r e ff for 
all the stars in the model. It is important to realize the to- 
tal variation in metallicity across our sample is not large: 
[Z/H] varies from « 0.0 to sa 0.25 with a median of 0.15 
(see Figure [6j. There are many galaxies in our sample with 
[Fe/H]« -0.1 and [Mg/Fe]« 0.2 and thus [Z/H]« 0.1. Using 
the Dartmouth Stellar Evolution Database^, we have created 
isochrones with [Fe/H]= -0.1 and [Mg/Fe]= 0.2 and find that 
these isochrones are almost identical to the solar metallicity, 
solar-scaled isochrones that we adopt for our base model. The 
most metal-rich galaxies in our sample have isochrones that 
may be 50- 100 K cooler than those used in our base model. 
Figure [6] reveals a weak correlation of IMF with [Z/H], and, 
importantly, at [Z/H] > 0. 1 the inferred IMFs range from MW- 
like to steeper than Salpeter. This suggests that any model 

7 http://stellar.dartmouth.edu/'^models/ 




Figure 13. IMF versus inferred SF timescale (left panel), inferred SFR surface density, Esfr (middle panel), and inferred pressure (right panel). All quantities 
are measured within R e /8 except for M31, where quantities are measured within the central 4" = 15 pc. The quantity plotted on the y-axis is sensitive only to 
the IMF, as it is the true mass-to-light ratio divided by the mass-to-light ratio assuming a MW IMF. The SF timescale is inferred from the measured [Mg/Fe] 
abundance. The SFR surface density is estimated from the stellar mass, SF timescale, and present galaxy size, and the pressure is estimated from the mass and 
present galaxy size. Symbol colors are as in Figure|5] Solid lines are not fits to the points, they are simply intended to guide the eye. 



systematics that correlate with metallicity are not driving the 
inferred IMF variation with a and [Mg/Fe] shown in Figure 
[5] Ultimately, more work is needed in the construction and 
calibration of models at high metallicity. 

It will be difficult to prove unequivocally that the IMF does 
indeed vary from galaxy to galaxy based on integrated light 
measurements. However, it presently remains the best expla- 
nation for the observed spectral features in early-type galaxies 
given the available models. 



6.2. Origin of the Observed Trends 

If we now take the inferred IMF variation at face value, 
we can ask what physical mechanism(s) may give rise to the 
observed correlations. The observed correlation between the 
IMF and a-enhancement appears to be stronger than the cor- 
relation with a; we will therefore interpret the former as the 
more fundamental relation. Of course, nearly all properties 
of early-ty pe galaxies are strong l y correlated with on e an- 
other (e.g..|Fa ?er & Jacksonll976t lDiorgovski & Davis 



Worthev et a 
Graves et al.l 



119921: iTrager et al.fl2000HThomas et" 



2009b . so further work will be required to posi- 



tively identify the fundamental underlying variable(s) govern- 
ing the variation in the IMF. 

The level of a-enhancement in a stellar population is 
normally interpreted in terms of a star formation (SF) 
timescale — higher a-enhancements correspond to shorter 
SF timescales. We adopt a relation between [a/Fe] and SF 
timescale based on a simple chemical evolution model pre- 
sented in iThomas et alj (120051) . [o/Fe]« ± - glogAf, and in 
this section we assume that [Mg/Fe] = [a/Fe]. We caution that 
the precise relation between timescale and a-enhancement 
depends on the stellar population mod el and the det ails of 
galactic chemical evolution (e.g., Arri goni et al.| [2010). The 
resulting relation between the IMF and SF timescale is shown 
in the left panel of FigureQj] Notice that the galaxies with the 
most bottom-heavy IMFs have inferred SF timescales of only 
200-300Myr. 

We can go one step further and estimate an average star for- 
mation rate (SFR) surface density, Esfr, within R e /% based on 



the stellar mass within this radius^, M*, and the SF timescale: 
Ssf = M*/47rAf(fl e /8) 2 . The result is shown in the mid- 
dle panel of Figure Qj] Galaxies with the most bottom- 
heavy IMF have inferred SFR surface densities in excess of 
100MQyr _1 kpc~ 2 . In the local universe such high SFR sur- 
face densities ar e found only in t he most extreme circumnu- 
clear starbursts (Ke nnicutill 19981) . We emphasize that these 
quantities are based on the present stellar density within R e /8. 
Observations of massive early-type galaxies s uggest that their 
centra l densities actually decrease with time (Bezanson et al. 
2009), which a ppears to be a c onsequence of both major and 
minor mergers (Oser et al. 2012). These inferred SFR surface 
densities may therefore be lower limits to the true SFR densi- 
ties. 

Another quantity of interest is the pressure of the system, 
which can be estimated via P oc Ml/R 4 . We have computed 
the effective pressure within /? e /8 for our sample and plotted 
this against our best-fit IMFs in the right panel of FigureQj] A 
correlation is apparent, but it is weaker than the other relations 
shown in this figure. This may be due to the fact that the 
effective pressure estimated at z = is only weakly correlated 
with the pressure at the epoch of formation. 

In each of these panels we have included simple power- 
law relations. These were not fits to the data; the power-law 
indices were chosen by eye to represent the mean trend in the 
sample. 

The data therefore support a scenario wherein the IMF is 
correlated with the intensity of star formation and/or the effec- 
tive pressure of the system, in the sense that higher SFR den- 
sities and higher pressures correspond to more bottom-heavy 
IMFs. 

A number of recent papers have pointed to fragmen- 
tation in supersonically turbulent molecular clouds as the 
key physi cal process g overnin g the shape of th e stel- 
lar IMF dPadoan et al.l Il997h IPadoan & Nordlundl l200l 

8 The stellar mass within R e /8 is estimated b y combining the best-fi t 
M/Lg ratios, the total K-b and luminosities from Capoellari et al. J2011I) . 



and the Sersic indices from Falcon-Barroso et al. (2011). The latter quan- 
tity is used to estimate the fraction of the total light contained within R e /&. 
For M3 1 the /-band luminosity within our extraction aperture was derived 
from Sick et al., in prep. 
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iHennebeUe & Chabrieri 120081: iHopkinsI l20T2aUbh . In this 
framework the Mach number within the molecular cloud is 
a key variable affecting the shape of the IMF. Qualitatively, 
and for all other variables held fixed, a higher Mach number 
results in a more bottom-heavy IMF. 

As noted above, local circumnuclear starbursts may be 
analogs to the most massive early-type galaxies in our sample 
at their formation epoch. The local starbursts have very high 
SFR densities and have molecular gas that is on average hotter 
and more super sonically turbulent than the molecular gas in 
the M ilky Way dDownes & Solomonll998l:lBryant & Scoviliel 
1999). The higher Mach numbers in starbursts may be the re- 
sult of th e higher rate of supernovae (SNe) in these systems. 
IHopkinsI d2012bl) has combined these ideas with an analytic 
theory for the IMF to demonstrate that such galaxies should 
have bottom-heavy IMFs. The Mach number is critical to this 
conclusion: the thermal Jeans mass (My oc j o~ I//2 , where c s 
and p are the sound speed and density, respectively) in star- 
bursts is frequently larger than in the MW disk, and thus a 
simple Jeans argument would sug gest that the IMF in star- 
bursts should be bottom-light (e. g., [Larson 1998; Baug h etaO 
12005b Narayanan & Pavel 120 12h . The larger Mach numbers 
in starbursts overcomes the effect of the increasing thermal 
Jeans mass to result in a lower characteristic mass in the IMF. 

The basic idea is that galaxies with higher sustained Ssfr 
have a rate per unit volume of SNe, which can drive more 
highly supersonic turbulence, resulting in a higher typical 
Mach number in molecular clouds that in turn lowers the 
characteristic mass of the IMF. Notice that this picture does 
not imply that all a-enhanced systems have bottom-heavy 
IMFs. The Ssfr must be sustained long enough for the SNe 
to drive turbulence. In particular, globular clusters, which are 
a-enhanced systems, would not be expected to have bottom- 
heavy IMFs because star formation in such systems is approx- 
imately instantaneous. 

We conclude this section with the caution that isothermal- 
ity, a key assumption in the models mentio ned above , likely 
does not hold in the regimes of interest. Kjum holzl (1201 ll) 
has argued instead that radiative feedback is the key physical 
process setting the characteristic mass of the IMF, and that 
the Jeans mass (whether thermal or turbulent) plays no direct 
role. In Krumholz's model the ISM pressure is the key vari- 
able, and there may be some evidence for IMF variation with 
pressure in Figure [131 

7. SUMMARY 

In this paper we have confronted high-quality absorption 
line spectra of 38 early-type galaxies and the nuclear bulge of 
M3 1 with a new stellar population synthesis model that incor- 
porates flexible abundance patterns and IMFs. These data and 
models extend beyond 1 fim, where IMF-sensitive absorption 
features allow for a strong constraint on the IMF and stellar 
mass-to-light ratio within individual galaxies. The data sam- 
ple the inner regions of the galaxies (to R e /%) and so conclu- 
sions regarding the stellar populations of these galaxies apply 
strictly to these inner regions. We now summarize our main 
results. 

• Evidence is found for an IMF that varies systematically 
with galaxy velocity dispersion and a-enhancement. 
Steeper (more bottom-heavy) IMFs are found in more 
massive systems. The best-fit mass-to-light ratios do 
not violate dynamical constraints. 

• At the highest velocity dispersions and 



a-enhancements the IMF becomes steeper than 
even the canonical Salpeter IMF, with inferred Zf-band 
mass-to-light ratios a factor of w 2 higher than would 
be expected for a universal, Milky Way IMF. 

• Systematic uncertainties in the models translate into 
< 50% uncertainties in the derived mass-to-light ratios, 
while the median statistical uncertainty is w 7%. In par- 
ticular, there appears to be some tension between the 
models and data regarding the CaT feature, which is in- 
teresting because CaT is strong in giants while the oth- 
ers are strong in dwarfs. We have demonstrated that the 
tension around the CaT feature is visible for all of the 
galaxies in our sample and suggests that the relatively 
poor modeling of the CaT feature does not strongly in- 
fluence the derived M/L ratios. 

• These results are consistent with a scenario wherein 
the IMF becomes increasingly bottom-heavy as the SF 
timescale becomes increasingly short, the SFR sur- 
face density becomes increasingly high, and/or the ISM 
pressure becomes increasingly high. These trends are 
broadly consistent with several recent conjectures for 
the origin of the IMF, but more detailed models are 
needed before conclusive statements can be made. 
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APPENDIX 

In an online-only appendix we provide the fits to all galaxies included in Paper I (excluding NGC 4621 and NGC 524, which 
are shown in Figures 1 and 2 in the main text). 



